Comparative Bi-stochastizations and Associated Clusterings/Regionalizations of the 1995-2000 U. S. Intercounty Migration Network
نویسنده
چکیده
Wang, Li and König have recently compared the cluster-theoretic properties of bi-stochasticized symmetric data similarity (e. g. kernel) matrices, produced by minimizing two different forms of Bregman divergences. We extend their investigation to non-symmetric matrices, specifically studying the 1995-2000 U. S. 3, 107× 3, 107 intercounty migration matrix. A particular bi-stochastized form of it had been obtained (arXiv:1207.0437), using the well-established Sinkhorn-Knopp (SK) (biproportional) algorithm–which minimizes the Kullback-Leibler form of the divergence. This matrix has but a single entry equal to (the maximal possible value of) 1. Highly contrastingly, the bi-stochastic matrix obtained here, implementing the Wang-Li-König-algorithm for the minimum of the alternative, squared-norm form of the divergence, has 2,707 such unit entries. The corresponding 3,107-vertex, 2,707-link directed graph has 2,352 strong components. These consist of 1,659 single/isolated counties, 654 doublets (thirty-one interstate in nature), 22 triplets (one being interstate), 13 quartets (one being interstate), three quintets and one septet. Not manifest in these graph-theoretic results, however, are the five-county states of Hawaii and Rhode Island and the eight-county state of Connecticut. These–among other regional configurations–appealingly emerged as well-defined entities in the SK-based strong-component hierarchical clustering. PACS numbers: Valid PACS 02.10.Ox, 02.10.Yn, 89.65.Cd, 89.75.Hc ∗Electronic address: [email protected] 1 ar X iv :1 20 8. 34 28 v2 [ cs .S I] 1 4 Se p 20 12
منابع مشابه
Matrix plots of reordered bistochastized transaction flow tables: A United States intercounty migration example
We present a number of variously rearranged matrix plots of the 3, 107×3, 107 1995-2000 (asymmetric) intercounty migration table for the United States, principally in its bistochasticized form (all 3,107 row and column sums iteratively proportionally fitted to equal 1). In one set of plots, the counties are seriated on the bases of the subdominant (left and right) eigenvectors of the bistochast...
متن کاملA Further (Itakura-Saito/beta=0) Bi-stochaticization and Associated Clustering/Regionalization of the 3,107-County 1995-2000 U. S. Migration Network
Abstract We extend to the β-divergence (Itakura-Saito) case β = 0, the comparative bi-stochaticization analyses–previously conducted (arXiv:1208.3428) for the (Kullback-Leibler) β = 1 and (squaredEuclidean) β = 2 cases–of the 3,107-county 1995-2000 U. S. migration network. A heuristic, ”greedy” algorithm–using the β = 1 results as an initial configuration–is devised. While the largest 25,329 en...
متن کاملMultiscale Network Reduction Methodologies: Bistochastic and Disparity Filtering of Human Migration Flows between 3, 000+ U. S. Counties
To control for multiscale effects in networks, one can transform the matrix of (in general) weighted, directed internodal flows to bistochastic (doubly-stochastic) form, using the iterative proportional fitting (Sinkhorn-Knopp) procedure, which alternatively scales row and column sums to all equal 1. The dominant entries in the bistochasticized table can then be employed for network reduction, ...
متن کاملDendrogram/Regionalization of U. S. Counties Based upon Migration Flows
Abstract We have obtained a ”hierarchical regionalization” of 3,107 county-level units of the United States based upon 1995-2000 intercounty migration flows. The methodology employed was the two-stage (double-standardization and strong component [directed graph] hierarchical clustering) algorithm described in the 2009 PNAS letter arXiv:0904.4863. Various features (e. g., cosmopolitan vs. provin...
متن کاملOptimum Design of Liquified Natural Gas Bi-lobe Tanks using Finite Element, Genetic Algorithm and Neural Network
A comprehensive set of ten artificial neural networks is developed to suggest optimal dimensions of type ‘C’ Bi-lobe tanks used in the shipping of liquefied natural gas. Multi-objective optimization technique considering the maximum capacity and minimum cost of vessels are implemented for determining optimum vessel dimensions. Generated populations from a genet...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012